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Abstract 



In this paper, we analyze the coding delay and the average coding delay of random linear network codes (a.k.a. 
dense codes) and chunked codes (CC), which are an attractive alternative to dense codes due to their lower complexity, 
over line networks with Bernoulli losses and deterministic regular or Poisson transmissions. Our results, which include 
upper bounds on the delay and the average delay, are (i) for dense codes, in some cases more general, and in some 

^v^j other cases tighter, than the existing bounds, and provide a more clear picture of the speed of convergence of dense 

codes to the (min-cut) capacity of line networks; and (ii) the first of their kind for CC over networks with such 

r — H probabilistic traffics. In particular, these results demonstrate that a stand-alone CC or a precoded CC provide a 

better tradeoff between the computational complexity and the convergence speed to the network capacity over the 
probabilistic traffics compared to arbitrary deterministic traffics which have previously been studied in the literature. 

>• I. Introduction 

Random linear network codes (a.k.a. dense codes) achieve the min-cut capacity over various network scenarios, 

o _ 

e.g., unicast over line networks, but at the cost of a rather high computational complexity [1|. Targeting the design 
of more computationally efficient network codes, Maymounkov et al. [2| proposed chunked codes (CC), which 

o 

generalize dense codes, and operate by partitioning the message of the source into non-overlapping (disjoint) 



sub-messages of equal size, called chunks [2]. Recently, a generalized version of chunked codes, referred to as 
overlapped chunked codes (OCC), were also independently proposed in |5) and (4j. It has been analytically shown 
in [5 1 that, for sufficiently small chunks, OCC provide a better tradeoff between the speed of convergence to (achieve 
or approach) the min-cut capacity and the message or packet error rate, compared to CC, over line networks with 
arbitrary deterministic traffics. This is while earlier in Q it was demonstrated that CC provide a better tradeoff 
between the speed of convergence to the min-cut capacity and the message error rate for sufficiently large chunks 
(also see (6) for more details). In this paper, our focus is on chunked codes. The extension of the analysis to OCC 
is not straightforward and is beyond the scope of this work. In chunked coding, each node at each transmission 
time randomly chooses a chunk, and transmits it by using a dense code. In fact, a dense code is a CC with only 
one chunk of the message size. Thus, CC require less complex coding operations due to applying coding on chunks 

f A preliminary version of this work has been presented partly in NetCod 2012, Cambridge, MA, USA, June 2012, and in part in ISIT 2012, 
Cambridge, MA, USA, July 2012. 
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smaller than the original message. This however comes at the cost of lower speed of convergence to the min-cut 
capacity compared to dense codes. 

The speed of convergence of dense codes and chunked codes to the min-cut capacity of line networks with 
arbitrary deterministic traffics (with deterministic transmission schedules and loss models) was studied in Q, |6j. 
It is not however straightforward to apply the results to the networks with probabilistic traffics. In particular, it has 
been shown that for arbitrary deterministic traffics (i) a dense code always achieves the capacity; (ii) a CC achieves 
the min-cut capacity, so long as the size of the chunks is lower bounded by a function super-logarithmic in the 
message size and super-log-cubic in the network length, and (iii) a CC, preceded by a capacity-achieving erasure 
code, approaches the min-cut capacity with an arbitrarily small but non-zero constant gap, so long as the size of 
the chunks is lower bounded by a function constant in the message size and log-cubic in the network length. 

Aside from the results for arbitrary deterministic traffics, Lun et al. [ 1 1 showed that dense codes achieve the 
min-cut capacity of line networks with probabilistic traffics specified by stochastic processes with bounded average 
rate. They however did not discuss the speed of convergence of such codes to the min-cut capacity. This issue was 
later studied in j7j, (8j, by analyzing the coding delajQand the average coding delaj^jof dense codes over some 
probabilistic traffics. There is however no result on CC over the networks with probabilistic traffics in the literature. 

Pakzad et al. 17) , for the first time, studied the average coding delay of dense codes (operating in F2) over line 
networks with deterministic regular transmissions and Bernoulli losses, where the special case of two identical links 
in tandem was considered. The analysis however did not provide any insight about how the coding delay (which is 
random with respect to both the codes and the traffics) can deviate from the average coding delay (which is random 
with respect to the codes but not the traffics). 

More recently, Dikaliotis et al. [8| studied both the average coding delay and the coding delay of dense codes 
(operating in a finite field of infinitely large size) over the line networks of arbitrary length with traffics similar to 
those in |7J, but under the assumption that there exists a unique worst link (i.e., a unique link with the minimum 
probability of transmission success) in the network. 

In this paper, we study the coding delay and the average coding delay of dense codes, and for the first time, 
chunked codes for different ranges of the chunk sizes, operating in the field of size two (F 2 ), over line networks 
with traffics similar to those in (T], |7J, JSJ. Our study has no limiting assumption on the traffic parameters or the 
length of the network. It is worth noting that any upper bound on the coding delay or on the average coding delay 
of any coding scheme over F 2 serves as an upper bound for the underlying code over any finite field of larger 
size. The method of analysis in this paper is itself, however, generalizable to finite fields of larger size, but the 
generalization is not trivial and is beyond the scope of this paper. 

'The coding delay of a code over a network with a given traffic (schedule of transmissions and losses) is the minimum time that the code 
takes to transmit all the message vectors from the source to the sink. The coding delay is a random variable due to the randomness in both the 
code and the traffic. 

2 The average coding delay of a code with respect to a class of traffics is the coding delay of the code averaged out over all the traffics (but 
not the codes), and hence is a random variable due to the randomness in the code. 
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The main contributions of this paper are: 

• We derive upper bounds on the coding delay and the average coding delay of a dense code, or a CC alone, or 
a CC with precoding, in the asymptotic setting, i.e., as the message size tends to infinity, over the traffics with 
deterministic regular transmissions or Poisson transmissions and Bernoulli losses with arbitrary parameters]^] 
The upper bounds are functions of the message size, the length of the network, and the parameters of the traffic 
and the code. We also consider a special case with unequal traffic parameters, where no two links have equal 
traffic parameters. The upper bounds, in this case, indicate how the coding delay or the average coding delay 
change as a function of the minimum of the (absolute value of the) difference between the traffic parameters 
of any two consecutive links in the network. 

• We show that: (i) our upper bounds on the average coding delay of dense codes are in some cases more general, 
and in some other cases tighter, than what were presented in |7), JSJ, and (ii) the coding delay of dense codes 
may have a large deviation from the average coding delay in both cases of identical and non-identical links; 
for non-identical links, our upper bound on such a deviation is smaller than what was previously shown in [8|. 
It is noteworthy that, for identical links, upper bounding such a deviation has been an open problem (see [8|). 

• We also show that: (i) a CC achieves the min-cut capacity, so long as the size of the chunks is bounded from 
below by a function super-logarithmic in the message size and super-log-linear in the network length, and 
(ii) the combination of a CC and a capacity-achieving erasure code approaches the min-cut capacity with an 
arbitrarily small non-zero constant gap, so long as the size of the chunks is bounded from below by a function 
constant in the message size and log-linear in the network length. The lower bounds in both cases are smaller 
than those over the networks with arbitrary deterministic traffics. Thus both coding schemes (i.e., stand-alone 
CC and CC with precoding) are less computationally complex (require smaller chunks), for the same speed 
of convergence (with or without a gap) to the min-cut capacity, over such probabilistic traffics, compared to 
arbitrary deterministic traffics. 

• In a capacity-achieving scenario, for such probabilistic traffics, we show that for CC: (i) the upper bound on 
the overhead (the difference between the coding delay and the min-cut capacitjQl grows sub-log-linearly with 
the message size and the network length, and decays sub-linearly with the size of the chunks, and (ii) the upper 
bound on the average overhead (the difference between the average coding delay and the min-cut capacity) 
grows sub-log-linearly (or poly-log-linearly) with the message size, and sub-log-linearly (or log-linearly) with 
the network length, and decays sub-linearly (or linearly) with the size of the chunks, in the case with arbitrary 
(or unequal) traffic parameters. For arbitrary deterministic traffics, the upper bound on the overhead or that 
on the average overhead was shown in |6| to be similar to the case (i), mentioned above, but with a larger 
(super-linear) growth rate with the network length. 

3 The scenario of deterministic regular transmissions and Bernoulli losses has been studied in 0, 0, and the scenario of Poisson transmissions 
with Bernoulli losses has been studied in fT) as a special case of the probabilistic traffics over line networks. 
4 For the definition of min-cut capacity used here, see Section II-A 
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This paper is an extended version of our earlier works (9), flO) , and contains more details on the methodology 
of the analysis and the details of the proofs. 

II. Network Model and Problem Setup 

A. Transmission and Loss Model 

We consider a unicast problem (one-source one-sink) over a line network with L links connecting L + 1 nodes 
{vi}o<i<L in tandem. The source node t> has a message of k vectors, called message vectors, from a vector space 
T over Fa, and the sink node vl requires all the message vectors. 

Each (non-sink) node at each transmission time transmits a (coded) packet, which is a vector in T . The packet 
transmissions are assumed to occur in discrete-time, and the transmission times over different links are assumed to 
follow independent stochastic processes. The transmission times over the i th link are specified by (i) a deterministic 
process where there is a packet transmission at each time instant, or (ii) a Poisson process with parameter A^ : < 
Aj < 1, where A,; is the average number of transmissions per time unit over the i th link. The transmission schedules 
resulting from (i) and (ii) are referred to as deterministic regular and Poisson, respectively. 

Each transmitted packet either succeeds (successful packet) or fails (lost packet) to be received. The successful 
packets are assumed to arrive with zero delay, and the lost packets will never arrive. The packets are assumed to be 
successful independently over different links. The successful packets over the i th link are specified by a Bernoulli 
process with (success) parameter pi : < pi < 1, where pi is the average number of successes per transmission 
over the i th link. The loss model defined as above is referred to as Bernoulli. 

For each traffic model as above, the parameters {p,} or {\i,Pi} are called the traffic parameters. In the case 
of traffics with parameters {pi} or {\i,Pi], the min-cut capacity is defined as the ratio of the message size k 
to the minimum (equivalent) traffic parameter mini<i<£j>j or vam.x<i<L \Ph respectively. For simplifying the 
terminology, hereafter, we refer to the "min-cut capacity" as the "capacity." 

B. Assumptions 

We assume that there is no feedback information in the network before the time that the sink node recovers all 
the message packets. Whenever the decoding process is successful, the sink node Uj, sends an acknowledge message 
to the node The node then stops transmitting new packets to the sink node, and relays the acknowledge 

message to the node Vl-2- The feedback relaying process continues over the links till the time that the source node 
vq receives the acknowledge message, and stops transmitting new packets. The feedback transmissions are assumed 
to be error-free and with zero delay. 

We also assume that the size of the memory at the network nodes is unbounded, i.e., all the packets, received 
by a node, will remain in the memory of that node till the end of the transmission time. 

C. Problem Setup 

The goal in this paper is to derive upper bounds on the coding delay and the average coding delay of dense codes 
and chunked codes over line networks with deterministic regular or Poisson transmissions and Bernoulli losses. 
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For some fixed < e < 1, the coding delay of a class of codes over a network with a class of traffics is 
upper bounded by N with failure probability (w.f.p.) bounded above by (b.a.b.) e, so long as the coding delay of 
a randomly chosen code over the network with a randomly chosen traffic is larger than N with probability (w.p.) 
b.a.b. e. The average coding delay of a class of codes over a network with respect to a class of traffics is upper 
bounded by N w.f.p. b.a.b. e, so long as the average coding delay of a randomly chosen code over the network 
with respect to the class of traffics is larger than N w.p. b.a.b. e. 

D. Asymptotic Notations 

Throughout the paper, we will use the asymptotic notations 0(.), o(.), and u>(.) defined as follows. For 
non-negative functions f(n) and g(n), we write: (i) f(n) — 0(g{n)) if and only if lim sup,^^ < oo; (ii) 
f{n) = o{g{n)) if and only if lim^^ = 0; (iii) f{n) = Sl{g{n)) if and only if limsup n ^. 0O > q- (i v ) 
fin) = u(g(ri)) if and only if lim^oo = oo; and (v) f(n) ~ g(n) if and only if lim^oo = 1. 

III. Deterministic Regular Transmissions and Bernoulli Losses 

In this section, for each coding scheme, we first consider arbitrary traffic parameters {pi}; and next, we consider 
a special case with unequal traffic parameters. 

A. Dense Codes 

In a dense coding scheme, the source node, at each transmission opportunity, transmits a packet by randomly 
linearly combining the message vectors, and each non-source non-sink (interior) node transmits a packet by randomly 
linearly combining its previously received packets. The vector of coefficients of the linear combination associated 
with a packet is called the local encoding vector of the packet, and the vector of the coefficients representing the 
mapping between the message vectors and a coded packet is called the global encoding vector of the packet. The 
global encoding vector of each packet is assumed to be included in the packet header. The sink node can recover 
all the message vectors as long as it receives an innovative collection of packets (with linearly independent global 
encoding vectors) of the size equal to the number of message vectors at the source node. 

The first step in our analysis is to lower bound the size of a maximal collection of packets at any non-source 
node until a certain time, referred to as the decoding time, where the entries of the global encoding vectors of 
the packets in the collection are independent and uniformly distributed (i.u.d.) Bernoulli random variables. Such 
packets are called the globally dense packets. Based on the result of [6 Lemma 1], the size of a maximal collection 
of globally dense packets at a node can be lower bounded by the number of packets with linearly independent 
local encoding vectors at that node. With a slight abuse of terminology, the packets with linearly independent local 
encoding vectors are called the dense packets. (By the above argument, the set of dense packets at each node is a 
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subset of the globally dense packets at that nodej^J The set of dense packets are of main importance in our analysis. 
In particular, by studying the linear dependence/independence of the local encoding vectors of the successful packets 
over a link, the number of dense packets, and further, the size of a maximal collection of globally dense packets, 
at the receiving node of that link can be lower bounded. We, next, upper bound the decoding time such that the 
probability that the underlying collection fails to include an innovative sub-collection of a sufficiently large size 
(equal to the message size) is upper bounded (this probability upper bounds the probability of the failure of a dense 
code to recover all the message packets till the underlying decoding time). 

Let Oi (li) be the set of labels of the successful (i.e., not lost) packets transmitted (received) by the i th node 
and let T>i be the set of labels of the dense packets at the i th node. Let Qi+i and Qi, with entries over F2, be 
the decoding matrice^] at the [i + l) th and i th nodes, respectively, and Tj, the transfer matrix at the i th node, be 
a matrix over F2 such that Qj+i = TiQi. The rows of Tj are the local encoding vectors of the successful packets 
transmitted by the i th node, i.e., (Ti) n j = \ n ,j, Vn 6 0, and Vj € li, where A„ is the local encoding vector of the 
successful packet. Let Qi, the modified decoding matrix at the i th node, be Qi restricted to its rows pertaining 
to the global encoding vectors of the dense packets. Let %, the modified transfer matrix at the i th node, be a matrix 
over F 2 such that Qi+\ = TiQi, i.e., 

(Ti) n ,j = A n j + ^ -W7*,j)Vn G Oi,Vj €T>i 
ieii\Vi 

and {j£,j} are in F 2 satisfying J2j<=T> 1l,i^3,k — ^i,ki Vfc € Zj. The n th row of % indicates the labels of dense 
packets at the i th node which contribute to the n th successful packet sent by the i th node, and the j th column of Tj 
indicates the labels of successful packets sent by the i th node to which the f h dense packet contributes. 

Let Q be a matrix over F 2 . The density of Q, denoted by T>{Q), is the size of a maximal dense collection of 
rows in Q, where a collection of rows is dense if the rows have all i.u.d. entries over F 2 . Further, Q is called a 
dense matrix if all its rows form a dense collection. Let T be a matrix over F 2 . The rank of T, denoted by rank(T), 
is the size of a maximal collection of linearly independent rows in T over F 2 . 

Lemma 1: Let Q be a dense matrix over F 2 , and T be a matrix over F 2 , where the number of rows in Q and 
the number of columns in T are equal. If rank(T) > 7, then T>(TQ) > 7. 

Proof: The proof can be found in JS). □ 

Since = TiQi, and Qi is densej^jby applying the result of Lemma [l] it follows that can be 

bounded from below so long as rank(T, ) is bounded from below. The rank of the modified transfer matrix T, is a 

5 One should however note that the local encoding vectors being linearly independent (i.e., forming a "dense" collection of packets) is not a 
necessary condition for the packets to form a "globally dense" collection. In particular, the collection of all the packets successfully transmitted 
by the source node is globally dense (by the definition) but some packets in this collection might have local encoding vectors linearly dependent 
on those of the rest (and hence such packets do not belong to the dense collection of the packets successfully transmitted by the source node). 

6 The global encoding vectors of the received packets at a node form the rows of the decoding matrix at that node. 

7 The rows in Qi are the global encoding vectors of the dense packets at the i' h node, and based on an earlier argument, the set of dense 
packets at a node belong to the set of globally dense packets at that node. Thus, the entries of all the rows in Qi are i.u.d. over F2. 
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function of the structure of T i7 and the structure of such a matrix depends on the number of dense packet arrivals 
at the i th node and the number of successful packet departures from the i th node before or after any given point in 
time. Such parameters depend on the traffic over the i th and (i + l) th links, and are therefore random variables. It is 
however not straightforward to find the distribution of such random variables. We thus use a probabilistic technique 
as follows to study such variables. 

Let (0, Nt] be the period of time over which the transmissions occur (Nt is the decoding time). We split the 
time interval (0,Nt] into w disjoint subintervals (partitions) of length Nt/w. The first partition represents the 
time interval (0, Nt/w}; the second partition represents the time interval (Nt/w, 2Nt/w], and so forth. For every 
1 < i < L and 1 < j < w, all the arrivals over the i lh link in the first j partitions, i.e., in the time interval 
(0, jNt/w], occur before any departure over the (i + l) th link in the (j + l) th partition, i.e., in the time interval 
[jNt/w, (j + 1)Nt/w]. Thus the number of arrivals at the i th node before any given point in time within the 
(j + l) th partition is bounded from below by the sum of the number of arrivals at this node in the first j partitions. 

This method of counting is however suboptimal since there might be some extra arrivals in the (j + l) th partition, 
which arrive before the given point in time within this partition. To control the impact of sub-optimality, the length 
of partitions needs to be chosen with some care. To be specific, the length of partitions, on the one hand, needs to 
be sufficiently small such that there is not a large number of arrivals in one partition compared to the total number 
of arrivals in all the partitions. This should be the case so that ignoring a subset of arrivals in one partition does 
not result in a significant difference in the number of arrivals before each point in time within the same partition. 
On the other hand, the partitions need to be long enough such that the deviation of the number of arrivals from the 
expectation in one partition is negligible in comparison with the expectation itself. This ensures the validity of our 
analysis and the tightness of our results. 

Let Iij represent the j th partition pertaining to the i lh link for every i and j. We focus on the set of all the packets 
over the i lh link in the active partitions pertaining to this link, where, for every i, j, 1^ is an active partition if and 
only if i < j < w — L + i. Such a partition is active in the sense that (i) there exists some other partition over the 
upper link so that all its packets arrive before the departure of any packet in the underlying active partition, and (ii) 
there exists some other partition over the lower link so that all its packets depart after the arrival of any packet in 
the underlying active partition. In particular, the first w — L + 1 partitions pertaining to the first link are all active; 
the w — L + 1 partitions pertaining to the second link starting from the second partition are all active and so forth. 
Let Wt represent the total number of active partitions pertaining to all the links, i.e., 

w T = L(w - L + 1). (1) 

We start off with lower bounding the number of successful packets in all the wt active partitions. Let 1^ be an 
active partition, and (fij be the number of (successful) packets in Jy. Since the length of the partition 1^ is Nt/w, 
and by the assumption the packet successes over the i lh link follow a Bernoulli process with the parameter p i7 ipij 
is a binomial random variable with the expected value ipi = piN T /w. Let 

p= mm Pi, (2) 

l<z<L 
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and 



ip = pNt/w. 



(3) 



For any real number x, let x denote | . By applying the Chernoff bound, one can show that tp^ is not larger than 
or equal to 



w.p. b.a.b. e/wx, so long as < 7* < 1 is chosen such that r is an integer, and 7* goes to as Nt goes to infinity, 
where 



For all suppose that ip^j is larger than or equal to r. This assumption fails if the number of packets in some 
active partition is less than r. Hence, the failure occurs w.p. b.a.b. e. 

Next, for every 1 < i < L and 1 < j < w — L + 1, we lower bound the number of dense packets in the first 
j active partitions over the i th link. Before explaining the lower bounding technique in detail, let us introduce two 
lemmas which will be useful to lower bound the rank of the modified transfer matrix at each node (depending on 
whether the number of dense packet arrivals at the i lh node over the i th link in a given partition is larger or smaller 
than the number of packet departures from that node over the (t + l) th link in the partition with the same index as 
the underlying partition pertaining to the i th link). 

Let w*, r* and {rf}i<i<: w * be arbitrary non-negative integers, and let rjj, ax = maxi<;< w * r ; * and r^ in = 
mmi<i< w * r ; *. For any pair such that 1 < j' < i' < w*, let Ty^i be an r* x r*, dense matrix over 

F 2 ; for any other pair let T^ji be an arbitrary r* x r*, matrix over F 2 j^Let T = ^T%>j']i<i',j><w*- The 

matrix T is called random block lower-triangular (RBLT) (see Figure [TJl. 

Lemma 2: Let T be an RBLT matrix with parameters w*, r* and {r ; * : < r ; * < r*}i<;< lu ». Let n* — 
YIkkw* r i- ^ or ever Y integer < 7 < n* — 1, 



where u* = \(n* - 7)/r^J. 

Proof: For any integer < 7 < n* — 1, let T' be T restricted to its first n* — 7 columns. Since T' is an 
w*r* x (n*— 7) sub-matrix of T, Pr{rank(T) < n* — 7} < Pr{rank(T') < n* — 7}. Suppose thatrank(T') < n*— 7. 
Then there exists a nonzero column vector v of length n* — 7 over F 2 such that the column vector T'v of length 
w*r* is an all-zero vector. For a given integer 1 < j < n* — 7, suppose that the first non-zero entry of v is 
the j th . There exists 2™ such vectors. Let us define Tq = for convenience. Let r be an integer satisfying 

Eo<Kr r i <j< Eo<i< T+ i r h and Tmax be an integer satisfying 2~2 <i< rmxl r i < n * - 7 < Eo<i<r m »+i r i- B Y 
the definition, it follows that < r max < min{w*, u* — 1}. It should not be hard to see that r and r max are unique. 

8 For any pair (i 1 such that 1 < i' < j' < w* , the entries of Tj/ ji might be dependent on the entries of TJj« j«, for any other pair 
such that 1 < j" < i" < w* . 
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(4) 




(5) 




Fig. 1. The structure of a random block lower-triangular (RBLT) matrix T with parameters w*,r* and {r ; * }kk w *. The shaded blocks 
represent the dense sub-matrices of T with i.u.d. Bernoulli entries, and the blank blocks represent those sub-matrices of T with arbitrarily 
dependent or independent entries with respect to the entries of the dense sub-matrices of T. 



For every t < fmax? define s* — ^o<«<r ^i' The column of T' has at least (t^* — 7~)r* i.u.d. Bernoulli 
entries, and hence the vector T'v has at least (w* — r)r* i.u.d. Bernoulli entries. Thus, T'v is all-zero w.p. b.a.b. 
2~ 7+ ™ ~ w r J2i<j< n *--y 2 rr _-7 '> noting that r depends on j. We rewrite the sum as: 

E 2 ^'+ E 2r *~ J + 

0<j<sj s*<j<s* 

...-)- 2 Timr "~ : > = 

^ 2^+2 r *-^ ^ 2~ J + 

0<j<r* 0<j<r* 

1_ 2 r ™« r *- s x m « V" 2~ J < 

0<j<n*- 7 - S * m!ix 

^ 2- J '+2 r *- s i 2 ~ 3 + 

0<3<r^ 0<j<r* ax 

h 2 Tm » r *" s ^„ ^ 2^' = 

0<j<rS ax 

^ 2~ j 2 T ' r *-<' < 

0<3<r^ 0<T'<r mllx 

Y 2 ~ j E 2 (r *~ r ™"- )r ' = 
0<j<r^ 0<r'<r mai 

(l-2 _r ^) 2 (r *" r »"» )T '. 

0<r'<r max 

The series X)o<T'<T milx -^ r *~ r "^ r converges from below to (r max + l)2( r *~ r mi„) T m» if r * — r ^ in goes to infinity. 
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Thus the following is always true: (1 - 2- r ^) E„< tI<t „ 2(r * _r "" )T ' < (w + 1)(1 - 2~ r *™*)2( r * - r *^ T ™ < 
u*(l - 2- r ™*)2( r *- r min)( u *- 1 ). This proves the lemma. □ 
Lemma 3: Let T be an RBLT matrix with parameters w*, r* and {r ; * : < r* < r1}i<i< w *. Let n* = w*r*. 
For every integer < 7 < n* — 1, 

Pr{rank(T) < n* - 7} < u* (l - 2~ r *) 2^ l+n ' ~ w ' r ^+^-r* 



where u* = |"(n* - 7)/r*~|. 

Proof: We start the proof by noting that T has a smaller number of rows than columns, and the minimum 
number of rows and columns gives an upper bound on the rank of the matrix. Let T' be T restricted to its last 
n* - 7 rows. For every < r < w*, define s* = J2o<j<w-r r j ■ Thus, T is of size (n* - 7) x Sq. Suppose 
that there exists a nonzero row vector u of length n* — - 7 whose entries are over F2, and its first nonzero entry is 
the j th , and the row vector vT' is all-zero. There are 2" such vectors. Let r be the largest integer smaller 

than j/r* . The j th row of T" has at least s* i.u.d. Bernoulli entries, and hence the vector vT' has at least s* i.u.d. 
Bernoulli entries. Thus, vT' is all-zero w.p. b.a.b. 2~ 7+n * ^2 1<j<n ,_~ 2~ J ~ S *. By definition, s* > (ty* - r)r^j n , 
and the preceding sum can thus be upper bounded as follows: Xa<j<n*-7 2~ j ~ St < Xa<j<n*- 7 2~i~( w ~ r ) r min. 
The latter sum can be rewritten itself as: 

0<j<r* r*<j<2r* 

. . . _|_ \^ 2 _:,_ ^* _ "* +:L ' r "™ = 

(u* — l)r* <j<ti* — 7 

2-"-*Cn ^ 2 _j + 2- {w "- 1)r ^- r " ^2 2 ~ j + 

0<j<r* 0<j<r* 
_ . . _|_ 2-(w*-l)r^ n -(«*-l)r* ^ '< 

0<j'<n* — 7— («* — l)r* 

2-« , * r min ^ 2"^ ^ 2* r "™ _r ^ T ' = 

0<j<r* 0<t'<m*-1 

(1 - 2- , '*)2~ u, * r ™ 2 (r »'» _r * )r '. 

0<T'<U*-1 

The last sum is bounded from above by 2( r "™~ r *^ u *~ 1 ">u*, and this completes the proof. □ 
Now, for every 1 < i < L and 1 < j < w — L + 1, we explain how to lower bound the number of dense 
packets in the first j active partitions over the i lh link. The lower bounding technique works in a recursive manner 
as follows: 

For every 1 < I < j, suppose that the number of dense packets in the first I active partitions over the (i — l) th 
link is lower bounded. Let T- be the modified transfer matrix at the I th node, restricted to the successful packet 
transmissions within the first j active partitions over the I th link (by the assumption, the number of such packets 
in each partition is bounded from below by r). Then, one can see that the matrix T- includes a sub-matrix T" 
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with a structure similar to that in Lemma [2] or the one in Lemma This can be seen precisely by the following 
replacements in Lemma [2] or Lemma [3] (i) w* with j (i.e., the number of underlying active partitions), (ii) r* with 
r (i.e., the lower bound on the number of successful packet transmissions in each of the first j active partitions 
pertaining to the i th link), and (iii) r*, for every 1 < I < w*, with the difference between the two lower bounds on 
the number of dense packets in the first I and the first I — 1 active partitions pertaining to the (i — l) th link (note 
that r\ is equal to the lower bound on the number of dense packets in the first active partition). 

The lower bounding process then proceeds as follows. Each successful packet in any of the first j active partitions, 
say the m th active partition, for some 1 < m < j, pertaining to the i th link can be written as a linear combination 
of the dense packets in the I th active partition pertaining to the (i — l) th link, for all 1 < I < m, and perhaps some 
extra dense packets in the (m+ l) th active partition pertaining to the (i— l) th link. Thus, for every 1 < I < m, each 
row of the sub-matrix T[_ x l (in the matrix T') indicates the labels of (some subset ofj^jthe dense packets in the 
I th active partition pertaining to the (i — l) th link which contribute to the linear combination of one packet (from 
the set of the r chosen successful packets) in the m th active partition pertaining to the i th link; and each column of 
T[_-y j indicates the labels of the successful packets to which one dense packet (from the set of the r; chosen dense 
packets) in the I th active partition pertaining to the (i — l) th link contributes. For any other m < I < w, the set of 
dense packets in the I th active partition pertaining to the (i — l) th link which contribute to the linear combination 
of one packet in the m th active partition pertaining to the i* link is not tractable in our analysis. Hence, the rows 
(or the columns) of each sub-matrix T[_ x t , for such values of / (i.e., m < I < w), might have independent or 
dependent entries (over Fj) with respect to the entries of the rows (or the columns) of the sub-matrices in the set 
of {T!_ 1 ;}i</< m . Next, by applying the proper lemma, the rank of the modified transfer matrix at the i th node, 
and finally, by applying Lemma [I] the number of dense packets in the first j active partitions over the z th link can 
be bounded from below. This completes the lower bounding process. 

Note that, because of its recursive nature, the above technique lower bounds the number of dense packets in 
the first j active partitions over the i th link as a function of the number of dense packets in the active partitions 
pertaining to the first link. Further, the packets over the first link are all globally dense (by the definition), and 
hence by using the recursion, the required results can be derived as follows. 

Let T>(Qj) be the number of "globally dense" packets in the first j active partitions over the i th link. By the 
definition, T>(Ql) is bounded from below by the number of "dense" packets in the first j active partitions over the 



i link. Let T> V {Q\) be a (probabilistic) lower bound on the number of dense packets in the first j active partitions 



'in the case of identical links, the modified transfer matrix at each node includes a sub-matrix similar to that in Lemmal^ However, in the 
case of non-identical links, depending on the traffic parameters, the modified transfer matrix at a node might include a sub-matrix similar to 
that in Lemma |5J or the one in Lemma |3j 

10 It is worth noting that there might be a number of dense packets which contribute to the linear combination of some packet transmission, 
but are not included in our lower bounding analysis. The exclusion of such (dense) packets weakens the tightness of the results, but does not 
affect the correctness of the analysis. 
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over the i th link, and of course a lower bound on £>(Qj)p]such that: if V{Q T S ) > V p (Q T s ), for every 1 < s < i and 
1 < T < j> except (s, r) = then the inequality T>(Qj) > T) V (Q\) fails w.p. b.a.b. e/wx- Let f%j be defined 

in a recursive fashion as the largest integer satisfying 

h 3 < v P (Qi) - (6) 

i<r<i 

Note that, at each step of our lower bounding process, the number of dense packets in a collection of active 
partitions, but not the number of dense packets in one individual active partition, is lower bounded. Further, the 
difference between the two lower bounds corresponding to the two collections of the first j and the first j — 1 
active partitions does not lower bound the number of dense packets in the j lh active partition. However, due to the 
recursion, we need to choose a certain number of dense packets at each step of the process (and ignore the rest, 
if any), and study the density of the packets in the next partition, at the next step of the process, with respect to 
the dense packets chosen till the previous step. We, thus, construct a collection of dense packets at the i th node as 
follows: starting with an empty collection (at the step zero), for every 1 < j < w — L + 1, at the j th step, we expose 
the packets in the active partitions over the i th link in order, one by one. We add a packet to the collection whenever 
the packet is dense (with respect to the current collection), until revealing fij new dense packets. The size of such 
a collection lower bounds the number of dense packets at the i th node; and in order to study the structure of the 
modified transfer matrix at this node, we fix the packets in the subsets of the underlying collection (each subset 
pertaining to one of the collection steps) and ignore the rest of packets. 

The set of packets over the first link are globally dense, and hence, for every 1 < j < w — L + 1, T>(Q J l ) > rj 
(by the assumption, each partition includes more than or equal to r packets). Further, for every 1 < i < L and 
1 < i < w — L + l, T>{Q J i ) is bounded from below as follows. 

Lemma 4: Consider applying a dense code over a line network of L links with deterministic regular transmissions 
and Bernoulli losses with parameters {pi}. Let wt and r be defined as in ([TJ and respectively. For every 
1 < i < L, the inequality 

T>(Ql) > r - log(tWr/e) - 1 

fails w.p. b.a.b. ie/wr- 

Proof: Fix 1 < i < L, Let Tj_i be the modified transfer matrix at the starting node of the i th link. Let 
be Ti-i restricted to the packets in the first active partition over the i th link. For every 1 < s < i, suppose 
V{Q\) > T> P (Ql), where V p (Ql) = r - log(w T /e) - 1, and V(Q\) = V p {Q\) = r. Then, by replacing w* ,r* 
and {ri}i<i< w * with l,r and fi, respectively, in Lemma |2j where f\ = fi-i^j^Jone can see that T£_i includes 
an r x f\ dense sub-matrix. Thus by applying Lemma [2] for every < 7 < f\ ~ 1, Pr{rank(T' i 1 _ 1 ) < f\ — 7} < 
it(l - 2- fl )2-i +f ' 1 - r + ir - fl K u - 1 \ where u = \(h - f)/f{\- Thus, Pr{rank(f; 1 _ 1 ) < (1 - 2~ fl )2~i+^- r , since 
u = 1. Taking 7 = log(«>r/e) + fx — r + 1, it follows that Pr{rank(T i 1 _ 1 ) < f\ — 7} < c/wt- By Lemma [T| 

"fp(Qf ) is a "probabilistic" lower bound on D(Q^) and hence the subscript "p." 
12 We often drop the subscript i in the notation rjj when there is no danger of confusion. 
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T>(Qj) < r — log(wT/ e ) — 1 W -P- b.a.b. c/wt- Thus, T> p (Q\) = r — log^^A) — 1. Taking a union bound over 
the first i links, T>(Q\) < r — log(Wr/e) — 1 w.p. b.a.b. ie/wr- □ 
Lemma 5: Consider a scenario similar to the one in Lemma [4] Let p be defined as in (|2). For every 1 < i < L 
and 1 < j < w — L+l, the inequality 

!>(<#)> rj-Ai 

fails w.p. b.a.b. ije/wx, so long as 

ui log — = o(pNt) (7) 

where = j(l + o(l))(log(w T /e) + l) + log((j(l + o(l)) + l)/e)+logw T + l, and the o(l) term is (log(w T /e) + 
l)/r. 

Proof: Fix 1 < i < L. For every 1 < s < i and 1 < r < j, except (s, r) = (i, j), suppose T>(Q T S ) > V p (Q T s ), 
where X>p(QJ) = rr - r(l + o(l))(log(w T /e) + 1) - log((r(l + o(l)) + l)/e) - log w T - 1, and the o(l) term 
is (log(wy/e) + 1)/V, and V(Q\) > T> p (Ql), where 2? p (Qs) = r — log(Wr/e) — 1. Let f T = fj_i )T , for every 
1 < T < j> ^min = min r f T , and f max = max T r T . Let and n = r D p (Q 3 i _ 1 ) = J2i<r<j ?r- Let us define Tj_i as in the 
proof of Lemma Q Let T^_ x be Tj_i restricted to the packets in the first j active partitions over the i th link. Then, by 
replacing w* , r* and {r ; *}i<;< lu * with j, r and {fV}l<T<j> respectively, in Lemma|2] one can see that T?^ includes 
an rj x n sub-matrix with a structure similar to the matrix T as in Lemma [2] Thus by applying Lemma [2j for every 
< 7 < n - 1, Pr{rank(f/l 1 ) < n -7} < u(l - 2- f ">»)2-^+"-^'+( r ^'™}(«-i), where u = ["(n- 7)/rW|. It is 
not difficult to see that, by our method of collecting the dense packets, it follows that f m ; n = fx. Further by applying 
Lemma [4] r x = 2> p (Q*_i) = r - log(w T /e) - 1. Thus, u < \rj/f{\ = \{l + o(l))j] < (1 + o(l))j + 1, since 
fi = r(l — o(l)), given log(u>rA) = °{ r ), where the o(l) term is (log(iDTA) + l)/ r - Since r ~ 93 = pNt/wt, 
the latter condition can be written as u>log(wr/e) = o(j?Nt)- Taking 7 = n — rj + (1 + o(l))j(log(wr/e) + 
1) + log(((l + o(l))j + l)/e) + logw T + 1, it follows that Pr{rank(T/l 1 ) < n - 7} < e/w T - Now, by applying 
Lemma [TJ T>(Qj) < n — 7 w.p. b.a.b. e/wx- Thus, T> p (Q{) = n ~ 7. Taking a union bound over the first j active 
partitions of the first i links, T>(Q{) < rj - (1 + o(l))j(log(tOr/e) + 1) - log(((l + o(l))j + l)/e) - logwr - 1 
w.p. b.a.b. ije/wT, where the o(l) term is (log(wy/e) + l)/r. This completes the proof. □ 

The result of Lemma [5] lower bounds the number of dense packets at the sink node, T>{Ql), as follows. 

Lemma 6: Consider a scenario similar to the one in Lemma |4] Let p and ip be defined as in |2]) and ((3), 
respectively. The inequality 

£>(Ql) > w T p/L~ w T p/Lyf{l/<p) log(w T /e) 

- (wt/L) log(w>r/e) — (wt/Lip) log 2 (wr/e) 

- (w T /Ltp) \og(w T /e) -log(w T /e) 

- log(u> T /L) - 1 (8) 

fails w.p. b.a.b. e, so long as 

it) log — = o{pNt) 
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where w ~ (pN T L 2 /\og(pN T L/e)) 3 . 

Proof: For the ease of exposition, let v = Wt/L, Lemma|5]gives a lower bound on V(Q V L ). Thus, we can write: 
V(Q L ) > V(Ql) > rv-v(l + o(l)) (\og(w T /e) + 1) - log((t>(l + o(l)))/e) - log w T - 1, where the o(l) term is 
(log(wT /e)) /r. This bound fails w.p. b.a.b. e, given the assumption that the number of packets in each active partition 
is larger than or equal to r. Since this assumption fails w.p. b.a.b. e, the lower bound on T>(Ql) fails w.p. b.a.b. e. 



Further, r = (1 — o(l))tp, where the o(l) term is yj(l/<p) h^wr/e). Thus, T>{Qi) > ^pv ~ o(ipv) — iilog(u>T/e) — 
v — o(ulog(w>r/e)) — o(u) — log(u/e) — logwy — 1 fails w.p. b.a.b. e, where o(ipv) ~ 0((pv^J log(wT/e))> 
and o(u) ~ (w/y) log(tOr/e). By considering the dominant terms, the right-hand side of the last inequality can be 
written as 

ipv - 0(vy / (p\og(w T /e)) - 0(vlog(w T /e)). (9) 
We now replace ip and v by pA^T /w and w (w ~ w), respectively, and rewrite ^ as 

pN T - 0(pN T L/w) - 

0(y/pN T w\og(wL/e)) - 0{w\og{wL/e)), (10) 
by using the fact that wt is 0{wL). We select w to be 



3/ PN T L 2 
V \og(pN T L/e) 

in order to maximize ( fT0] > subject to condition Q. This choice of w ensures that each 0{.) term in ( fT0| ) is o(pNx), 
and hence the coding scheme is capacity-achieving. □ 
Let riT be equal to the right-hand side of inequality ([8]). Thus, Ql fails to include an jit x fc dense sub-matrix 
w.p. b.a.b. e. 

Lemma 7: Let Q be an n x fc (k < n) dense matrix over Fj. Then, Pr{rank(Q) < fc} < 2~("~ fc ). 

Proof: The proof can be found in |j6j. □ 

By applying Lemma [7j Pr{rank(Q^) < k} is b.a.b. e, so long as fc < — log(l/e). By replacing e with e, it 
follows that the sink node fails to recover all the message vectors w.p. b.a.b. e, so long as k < tit — log(l/e) — 1. 
In the asymptotic setting, as Nt goes to infinity, ut can be written as 



pN T - (1 + o(l))(pN T L/w + \JpN T w\og(wL/e) + w\og(wL/e)). 

We rewrite the last inequality as 

k < pN T - (1 + o(l))(pN T L/w 

+ \/pNtw \og(wL/e) + w\og(wL/e)) — log(l/e) — 1. 

Let fc max be the largest integer k satisfying this inequality. Thus, fc max ~ pNx, and by replacing Nx with k/p 
(Nt ~ k/p), the following result is immediate. 
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Theorem 1: The coding delay of a dense code over a line network of L links with deterministic regular trans- 
missions and Bernoulli losses with parameters {pi} is larger than 



w.p. b.a.b. e, so long as 



i (*+ (l + o(l)) l^ + Jk^lozf 



w log = o(k) 




where w ~ (kL 2 /log(fcL / 'e)J 3 , p = mini<i<LPi, and the o(l) term goes to as k goes to infinity 

We now study the average coding delay of dense codes over the traffics with deterministic regular transmissions 
and Bernoulli losses. In this case, the deviation of the number of packets per partition should not be taken into 
account. Thus, by replacing r with <p in the analysis of the coding delay, the following result can be shown. 
Theorem 2: The average coding delay of a dense code over a network similar to Theorem [T] is larger than 



w.p. b.a.b. e, so long as 



If, , , . . / kL , wL 

- [k + (1 + o(l)) h w log — 

p \ \ w e 



w log = o(fc) 



where u> ~ (fcL/log(fcL/e)) 3 . 

Proof: The proof follows the same line as that of Theorem [T] except that r needs to be replaced with tp in 
the proof of Lemma ^ Thus, the 0(v\J <p\og(wT / e)) term in |9]) and the 0(yJ pN^w log (wL/e)) term in ( |T0] > 
disappear. Then, it should not be hard to see that the choice of w needs to maximize 

P N T - 0(pN T L/w) - 0(wlog(wL/e)), (11) 

instead of (JTOj, subject to condition Q. This can be done by selecting w to be 



pN T L 

□ 



log(p7V T L/e) ■ 



The choice of w in Theorem [2] is much larger than the one in Theorem [T] This is because, in this case, there is 
no gap between the lower bound on the number of packet transmissions in each partition and its expectation; and 
hence, the partitions do not need to be sufficiently long. 

It is worth noting that the preceding results might not provide a very clear picture of how the coding delay or the 
average coding delay are related to the traffic parameters of the links other than the one(s) with the minimum traffic 
parameter. However, by applying our analysis technique, while taking into consideration the actual values (and the 
ordering) of the traffic parameters of the links, new upper bounds (with more details) on the coding delay and the 
average coding delay can be derived. To be more specific, in such an analysis, for every 1 < i < L, depending 
on whether the i th or the (i + l) th link has a larger traffic parameter, either Lemma [2] or Lemma [3] can be used to 

l3 In the rest of the theorems, the o(l) term is defined similarly. 
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lower bound the rank of the modified transfer matrix at the i node, respectively. The rest of the analysis, however, 
remains the same. For example, the coding delay and the average coding delay of dense codes for the special case 
with unequal traffic parameters, where no two parameters are equal, can be upper bounded as follows. In particular, 
the upper bounds, in this case, demonstrate the dependence of the coding delay or the average coding delay on the 
minimum of the (absolute value of the) difference between the traffic parameters of any two consecutive links in 
the network. 

Let us assume pi > p2 > ■ ■ ■ > pl, without loss of generality. Let p = mini<.;<£Pi, 7 e = mini<i<L7 ei , and 
7 e . = \pi -pi-x\. Let n = (1 - j*)<Pi, where ip l = p l N T /w and 7* ~ y/(l/ipi) log(w T /e). For every 1 < i < L 
and 1 < j < w — L + 1, let <pij be defined as before (i.e., ipij is the number of successful packets in the j th active 
partition pertaining to the i th link). For all suppose that ip^j is larger than or equal to r.- L , i.e., there exist a 
sufficiently large number of successful packet transmissions in each partition over each link. (This assumption fails 
if, for some 1 < i < L, the number of packets in some active partition over the z th link is less than r^. Hence, the 
failure occurs w.p. b.a.b. e.) 

Since all the packet transmissions over the first link are globally dense, for every 1 < j < w — L+l, T>(Q\) > r\j. 
Further, by applying Lemma [3] it can be shown that, for every 1 < i < L and 1 < j < w — L + 1, the inequality 
f (Ql) > fij fails w.p. b.a.b. ije/wr, so long as 



e V IP)/ 
Let p, ip, 7* and r denote p^, (p^, j1 and r^, respectively. Thus, the inequality T>{Ql) > (1 — j*)ipwt/L fails 

w.p. b.a.b. e. By replacing (p with pNx/w, the right-hand side of the last inequality can be written as: 



choice of w needs to satisfy condition ( |12) , instead of condition Q. 

Theorem 3: Consider a sequence of unequal parameters {pi]\<i<L- The coding delay of a dense code over a 
line network of L links with deterministic regular transmissions and Bernoulli losses with parameters {pi} is larger 
than , 




(12) 



pN T - 0{pN T L/w) - 0(yfpN T w log(wL/e) ) . 



(13) 



The rest of the analysis is similar to that of Theorem [T] except that ( fTJI excludes the last term in ( fTO] ), and the 





choice of w needs to maximize 



pN T - 0{pN T L/w) 



(14) 



subject to condition ( |12) , instead of ( fTTj i subject to condition (J7]). 
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Theorem 4: The average coding delay of a dense code over a network similar to Theorem [3] is larger than 

- (k + l + o 1 — 
w.p. b.a.b. e, so long as 

i wL ( ■ h« A A 

w log = o I mm | — , 1 > ■ k I , 

i.e., u> ~ k/(f(k) log(kL/e)), and /(fc) goes to infinity, as fc goes to infinity, such that f(k) = o (k/(L\og(kL/e))). 

B. Chunked Codes 

In a chunked coding scheme, the set of k message vectors at the source node is divided into q disjoint subsets, 
called chunks, each of size a — k/q. The source node, at each transmission time, chooses a chunk independently at 
random, and transmits a packet by randomly linearly combining the message vectors belonging to the underlying 
chunkf^Each non-source non-sink node, at the time of each transmission, chooses a chunk independently at random, 
and transmits a packet by randomly linearly combining its previously received packets pertaining to the underlying 
chunk. The sink node can decode a chunk, so long as it receives an innovative collection of packets pertaining to 
the underlying chunk of a size equal to the size of the chunk. 

1 ) Capacity-Achieving Scenarios: In a CC, at each transmission time, a chunk is chosen w.p. 1 /q, and a packet 
transmission over the z th link is successful w.p. pj. Thus the probability that a given packet transmission over the 
i th link is successful and pertains to a given chunk is pi j q. Thus by replacing p. L with Pi/q in the analysis of dense 
codes in Section |III-A| the coding delay and the average coding delay of CC in a capacity-achieving scenario will 
be upper bounded. 

The results of dense codes are indeed a special case of those of CC with one chunk of size k. It is, however, 
worth noting that, due to the change in the parameters, the number of partitions w needs to satisfy a new condition: 
wq log = o(pNt) or wq log = o ^min j ^ , 1 j • pNj^j , instead of condition |7) or ( |12) , in the proofs of 
Theorems [5] and [6] or those of Theorems [7] and [8] respectively. Further, by replacing w with its optimal choice 
in the new version of (JTOj, (JTTJ, ( fT3| ) and ( p"4] l, each 0(.) term needs to be o(pA^/g) in order to ensure that CC 
are capacity-achieving in the underlying case. Such a condition lower bounds the size of chunks (a) by a function 
super-logarithmic in the message size (k). 

Theorem 5: The coding delay of a CC with q chunks over a line network of L links with deterministic regular 
transmissions and Bernoulli losses with parameters {pi} is larger than 



P 

w.p. b.a.b. e, so long as 




1 ' k + (1 + o(l)) [ ^ + \jk ^wqlog^j^J +wq\og^k 



q = o(k/(L\og(kL/e))), 



14 The "random" scheduling of the chunks and the "random" coding within the chunks have been shown to perform effectively when the 
feedback information is not available at the network nodes j6]. However, for cases with feedback, more efficient scheduling policies and coding 
schemes have been proposed in the literature (e.g., see II II). 
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and 



1 wLc l (u\ 
wq log = o(k) 

e 



where w ~ (kL 2 j \q\og(kL j 'e))) 3 , and p = mirii<i<L pi. 

Proof: The proof follows the same line as that of Theorem [TJ by implementing the following modifications. 
Let us replace p and e with p/q and e/q, respectively. Then, ip — pNT/wq, and r = (1 — "f*)(f, where 7* ~ 
\J (1/V) \n(wTq/e)- For every 1 < i < L, and 1 < j < w — L + 1, let T>(Q 3 i ), T> p (Q 3 i ), and be defined as 
in Section |III-A but only restricted to the packets pertaining to a given chunk (not all the chunks). For every 



i, j, T>(Qi) can be lower bounded as follows (the proofs are very similar to those of Lemmas |4] and pj: for every 
1 < j < w — L + 1, T>(Q\) > rj, and for every 1 < i < L and 1 < j < w — L + l, T>(Q J i ) fails to be larger than 
r i ~ i(l + °(1)) log(wT9/e), w.p. b.a.b. ije/wrq, so long as 

WTq 

wqlog = o(pNt). (15) 

Thus the number of dense packets pertaining to a given chunk at the sink node fails to be larger than 

pN T Q / pN T L 
q \ wq 



O U^log^f -oLlog^f) (16, 



w.p. b.a.b. e/q. In order to maximize ( [To} subject to condition ([15), we select w to be 

3/ PN T L 2 

Y glog(p^ T i/e)' 

Now let us assume that Nt is (1 + o(\))k/p. By replacing e with e, in the preceding results, and by replacing k 
and e with fc/g and e/q, respectively, in Lemma [7] it follows that the sink node fails to decode a given chunk w.p. 
b.a.b. e/q, so long as Nt is larger than 



Uk+(1 + 0(1)) + (wqlog ^) + ^log W -f j j . (17) 

Taking a union bound over all the chunks, it follows that the sink node fails to decode all the chunks w.p. b.a.b. e, 
so long as Nt is larger than {11) . To ensure that the lower bound on Nt is (1 + o(l))k/p, all the terms in (17) , 
excluding the first one, need to be o(k/p). This condition is met so long as q is 

k \ 

□ 



L\og(kL/e) 



Theorem 6: The average coding delay of a CC with q chunks over a network similar to Theorem [5] is larger than 

1 '* + (! + 0(1)) ( k -±+wqlog WqL 



p \ \ w e 

w.p. b.a.b. e, so long as 

q = o(k/(L\og{kL/e))), 
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and 



1 wLc l (u\ 
wq log = o(k) 

e 



where w ~ (kh / (q\og(kL/ e))) 5 . 

Proof: The proof is similar to that of Theorem [5] except that r needs to be replaced with tp. This implies that 
the third term in ( [To} disappears. Thus, by selecting w to be 



pN T L 



Y qlog(pN T L/e) 

in order to maximize a new version of fTo} (i.e., where the third term in ( [To} is excluded), subject to condition ([15), 
it follows that the sink node fails to decode all the chunks w.p. b.a.b. e, so long as is larger than 

i(fc + (l + (l))(^+^log^)). (18) 

The rest of the proof follows that of Theorem [5] □ 
In the case of unequal traffic parameters, the coding delay and the average coding delay are upper bounded as 
follows. 

Theorem 7: The coding delay of a CC with q chunks over a line network of L links with deterministic regular 
transmissions and Bernoulli losses with unequal parameters {pi} is larger than 



i t + (l + o,l), ^Uog^) 



, = o[min<^,l^t/(ilog(ti/«))], 



w.p. b.a.b. e, so long as 

q = o ( min \ 

P 

where w — {kL 2 / (q\og(kL/e))) 3 ,p = um\i<i< L Pi, J e = miriK^L j e ., and 7^ = \p t - 

Proof: By replacing p and e with p/g and e/q, respectively, in the proof of Theorem [3] it follows that the 
number of dense packets pertaining to a given chunk at the sink node fails to be larger than 

pNx q ( pNxL 



q \ wq 



oU^log^j (19) 

w.p. b.a.b. e/q, so long as 

wqlog^y- =ofmin j^,lj-pA/rV (20) 

The rest of the proof is similar to that of Theorem [5] except that ( fT9] l excludes the last term in ( [To} , and the choice 
of w needs to satisfy condition (20) , instead of condition ( |15) , By selecting w to be 



pN T L 2 



q\og(pN T L/e) 
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in order to maximize (jT9j subject to condition ( |2"0] >, it follows that the sink node fails to decode all the chunks w.p. 
b.a.b. e, so long as Nt is larger than 



\ (* + (! + 0(1)) (v + ^^^^f) 



In ( pi) , each term, except the largest one, needs to be o(k/p), and this condition is met so long as q is 



(21) 



o I min { — , 1 1 • — — I . U 



p' J L\og(kL/e) 

Theorem 8: The average coding delay of a CC with q chunks over a network similar to Theorem [7] is larger than 

- (k+ (1 + (1)) _ 
P \ \w 



w.p. b.a.b. e, so long as 

<1 



= o (min{^,lj • fc/(/(fc)Llog(fc£A))) 



where w ^ k/ (qf(k) \og(kL/e)), and /(fc) goes to infinity, as k goes to infinity, such that f(k) — o (k/(L \og(kL/e))). 
Proof: The proof follows the same line as that of Theorem [5] except that the choice of w needs to maximize 

PN T f P N T L\ 



q \ wq 

subject to condition ( |20] i. To do so, we select w to be 

pN T 



qf{pN T )\og{pN T L/ey 

where f(n) goes to infinity, as n goes to infinity, such that f(n) — o (n/(L\og(nL/e))). The sink node fails to 
decode all the chunks w.p. b.a.b. e, so long as Nj- is larger than 

L ^ + (1 + 0(1))^^. (23) 



p \ \ w 

The second term in (|23]l needs to be o(k/p), and this condition is met so long as q is 



o [ min i — , 1 1 • , , — . . . I . U 



p' \ f(k)L\og(kL/e) 



2) Capacity-Approaching-with-a-Gap Scenarios: By the results of Section III-B1 one can conclude that CC are 



not capacity-achieving if the size of the chunks does not comply with condition a = u>(L\og(kL/e)). Also, the 



analysis of Section III-A does not apply to CC with chunks of small sizes violating the above condition. From 
a computational complexity perspective, CC with chunks of smaller sizes are, however, of more practical interest 
(e.g., linear-time CC with constant-size chunks). In the following, we study CC with chunks of a size constant in 
the message size. 
Let { Pi } 

i<i<L be an arbitrary sequence of traffic parameters, and let p — mini<j<£ pi. Let the size of the chunks 
a (= k/q) be a constant in the message size k, i.e., a = 0(1). Let the time interval (0, Nt] and its w disjoint 



partitions be defined as in Section III-A Let ipij be the number of packets (pertaining to a given chunk) in the 
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partition Jy, and tp^ be the expected value of ipij. Let ip = mini<j<£ <p>i. Then, ^ = piNr/wq, and <y9 = pNr/wq. 
Let ./Vt = (1 + j c )k/p, where < j c < 1 is an arbitrarily small constant. By replacing Nt with (1 + -f c )k/p, it 
follows that ip = (l+j c )a/w. Further, it is not hard to see that <p — O(l), since w has to be a constant (otherwise, 
if w goes to infinity, as Nt goes to infinity, then <p goes to 0, and for such a case, our analysis is not valid). 

By applying the Chernoff bound, it can be shown that Pr{tpij < (1 — 7*)</?} < e~ 7 2 ^, for every < 7* < 1. 
Taking e" 7 ^ < jb/^T, it follows that <pij is not larger than or equal to r = (1 — 7*)</J w.p. b.a.b. jb/wx, where 
7* is chosen to be the smallest real number larger than or equal to y/(l/<p) l^wr/ib) such that r (= (1 — 7*)0 
is an integer. It is not hard to see that 7* = O(l). Taking a union bound over all the active partitions of all links, 
it follows that ipij is not larger than or equal to r w.p. b.a.b. jb- 

Let T>(Ql) be the number of dense packets pertaining to a given chunk in the first j active partitions over the 
i th link. 

By applying Lemma [3] it can be shown that: (i) for every 1 < j < w — L + 1, V(Q{) > rj, (ii) for every 
1 < i < L, the inequality T>(Qj) > r — log(wr/7&) fails w.p. b.a.b. i^b/wr, and (iii) for every 1 < i < L and 
1 < j < w — L + 1, the inequality T>{Ql) > r — j log(wr/7b) — log((j + l)wT/ib) fails w.p. b.a.b. ijib/wT, so 
long as 

a = n (w 2 log y-^j . (24) 

By using the above results, it follows that the number of dense packets pertaining to a given chunk at the sink 
node fails to be lower bounded by 

"**-()( ^/^g^) - O feog ?*) (25) 



L \ L y 7 6 J \ L j b J 

w.p. b.a.b. jb- The lower bound is non-negative so long as a — Q(wlog(wT/7b))> and this condition holds so 
long as condition |24| holds. We select w to be y/ aL 2 /\og(aL / jb) to maximize ( |25] l. By replacing w with this 
value, (124b can be rewritten as 



ft^Vlog-^. (26) 



By replacing 7^, with 75, and by applying Lemma [7] it follows that the sink node fails to decode a given chunk 
w.p. b.a.b. 7b, so long as ( |2"5j ) is larger than a + log(l/7b). By replacing our choice of w in ( |25] >, it can be seen 
that, excluding the first term, the second term dominates the rest. By replacing ip with (1 + j c )a/w, the decoding 
condition becomes 

a = n( L ,\og— ). (27) 

V7c 7b7c/ 

Thus, a given chunk fails to be decodable w.p. b.a.b. jb so long as both conditions ([26]) and ( p7| ) are met. In other 
words, the expected fraction of undecodable chunks is bounded from above by 7b. By using a martingale argument 
similar to the one in |6| (by constructing a martingale sequence over the number of undecodable chunks), the 
concentration of the fraction of undecodable chunks around the expectation can be shown as follows. The proof is 
omitted to avoid repetition. 
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Lemma 8: By applying a CC with chunks of size a, satisfying both conditions ( |2"6| ) and ((27]), the fraction of 
undecodable chunks at the sink node until time Nt = (1 + 7 c )fc/p is larger than (1 + j a )jb, w.p. b.a.b. e, so long 

as 

« 2 /7a7 b 2 = o(fc/log(l/e)), (28) 

where < j a , t&, 7 C < 1 are arbitrary constants. 

By the result of Lemma [8] the fraction of chunks which are not decodable until time Nt becomes larger than 
(1 + 7 a )7b, w.p. b.a.b. e. Since 7a, 7b are non-zero constants, a CC, alone, might not decode all the chunks. 
However, the completion of decoding of all the chunks is guaranteed by devising a proper precoding scheme [6|. 
The precoding works as follows: The set of k message vectors at the source node constitute the input of a capacity- 
achieving erasure code, called precode. The rate of the precode is 1 — (1 + 7 a )7b, i.e., the precode decoder can 
correct up to a fraction (1 + 7 a )7b of erasures^] and the number of the coded packets at the output of the precode, 
called intermediate packets, is (l + (1 + 7 a )7b + 0(7& )) k. By applying a CC with chunks of size a, satisfying 
conditions ( |2o*] i, ( |27] i and ( |28| i, the fraction of the intermediate packets that are not recoverable at the output of 
the CC decoder until time (1 + 7 C ) (l + (1 + 7 a )7b + 0(7?)) § is larger than (1 + 7 Q )7b, w.p. b.a.b. e. Then, the 
precode decoder can recover all the k message vectors from the set of recovered intermediate packets. Therefore, 
the coding delay of a CC with precoding (CCP) is upper bounded as follows. 

Theorem 9: The coding delay of a CCP with chunks of size a and a capacity-achieving erasure code of rate 
1 — (1 + 7a)7b, over a line network of L links with deterministic regular transmissions and Bernoulli losses with 
parameters {p^ is larger than (1 + j c ) (l + (1 + 70)75 + 0(7?)) ~, w.p. b.a.b. e, so long as 

\l\7c 7b7c/ V 7b/ 
and a 2 /jalb ~ °(k/log(l/e)), where < 7 a ,7b,7e < 1 are arbitrary constants, and p = mhii<i<£Pi. 

In the case of the average coding delay of a CC with precoding, the following can be shown similar to Theorem [9] 
by replacing r with cp. 

Theorem 10: The average coding delay of a CCP with chunks of size a and a capacity-achieving erasure code 
of rate 1 — (1 + 7 Q )7b, over a network similar to Theorem |5j is larger than (1 + j c ) (l + (1 + 7 a )7b + 0(7?)) ^, 
w.p. b.a.b. e, so long as 

f L L 

a = £1 — log 

V7c 7b 7c 

and a 2 /7a7b = °(^/l°g(V e ))> where < 7 a ,7b,7c < 1 are arbitrary constants. 

In the special case of unequal traffic parameters, the coding delay and the average coding delay of CC with 
precoding can be upper bounded as follows. The proofs follow the same line as in the general case except that a 
new set of conditions needs to be satisfied based on the assumption that no two traffic parameters are equal. 

15 The precode does not have to be capacity-achieving and its rate can be arbitrarily close to 1 — (1 + 7 a )7i,, yet, it has to be able to correct 
up to a fraction (1 + 7a)7i> of erasures (for more details, see |6|). 
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Theorem 11: The coding delay of a CCP with chunks of size a and a capacity-achieving erasure code of rate 
1 — (1 + jajjb, over a line network of L links with deterministic regular transmissions and Bernoulli losses with 
unequal parameters {pi} is larger than (1 + j c ) (l + (1 + 7 a )7& + 0(7^)) |, w.p. b.a.b. e, so long as 

VIA 7c 767c/ V7e 7e7fc/J/ 

and a 2 /7a7b = o(fc/log(l/e)), where < 7 a ,7&>7c < 1 are arbitrary constants, p = mini<i<iPj, 7 e = 
mini <i < L 7e i , and j ei = \p t - Pi-i\. 

Proof: Let us assume pi > p 2 > ■■■ > Pl, without loss of generality. Let p = mhii<i<LPi, 7e = 



mini<i<L 7 6l , and 7 Gi = \pi ~Pi-i\. Let n = (l- 7*)^, where ipi = p,N T /wq and 7* ~ ^/(l/^) log(w T /7&)> 
and < 7b < 1 is an arbitrary constant. For every 1 < i < L and 1 < j < w — L + 1, let ifij be the number 
of packets (pertaining to a given chunk) in the partition Jy (the j th partition pertaining to the i th link), where the 
time interval (0, Nt] is split into w partitions of length Nt/w, and let (fi be the expected value of cpij. For all 
suppose that ip^ is larger than or equal to n. Let Nt = (1 + lc)k/p, where < 7 C < 1 is an arbitrarily small 
constant. By replacing Nt with (1 + j c )k/p, it follows that ifi = (1 +j c )pia/pw, and <p — 0(1), similar to those 
in the proof of Theorem [9] 

For every 1 < i < L and 1 < j < w — L + 1, let V(Ql), T> p (Q\), and r,j be defined as in the proof of 
Theorem [5] For every 1 < j < w — L + 1, T)(Q{) > rij (since all the packets pertaining to any chunk over the 
first link are globally dense). For every 1 < i < L and 1 < j < w — L + 1, by applying Lemma [3] it can be shown 
that the inequality V(Q :, i ) > nj fails w.p. b.a.b. ijjb/wT, so long as 

a^feog^V (29) 
V7S 7fc / 

Let 95, 7* and r denote (p^, 7^ and r^, respectively. Thus, the number of dense packets pertaining to a given 
chunk at the sink node fails to be larger than 

(1 + lc )a - O - O (Jaw log ^ . (30) 

We select w to be 



aL 2 



y iog(w T /7 fc ) 

to maximize (f30|> subject to condition d29]l. For this choice of w, condition ((29]i is met so long as 



( '.A'g ) (3!) 

7e 7e76 



By replacing 7^ with 7, in the preceding results, and substituting the selected value of w in ( |30] >, the result of 
Lemma [7] shows that the sink node fails to decode a given chunk w.p. b.a.b. 7b, so long as pO) is larger than 
a + log(l/7b). Based on the properties of the notation £!(.), the latter condition is met so long as 

a^nf^log— ). (32) 
V7c 7&7c/ 

The rest of the proof is similar to the proof of Theorem [9] except that in this case conditions pT) and ([32| need 

to be met, instead of conditions d26ll and d27ll. □ 
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Theorem 12: The average coding delay of a CCP with chunks of size a and a capacity-achieving erasure code 
of rate 1 — (1 + "f a )jb, over a network similar to Theorem |TT| is larger than (1 + j c ) (l + (1 + "/ a )lb + 0(7?)) ^> 
w.p. b.a.b. e, so long as 

a = [ -^log — 

\lilc 767c 

and a 2 /7a7b = o(fe/log(l/e)), where < 7 a ,7fc,7c < 1 are arbitrary constants. 



Proof: The proof follows the same line as that of Theorem 1 1 except that the choice of w needs to maximize 

(l + lc )a-o( — ) (33) 



subject to condition ( |29] l. To do so, the choice of w needs to be fl(L/j c ), and hence, condition p9| ) becomes 

a = 0(^log— J. □ 



IV. Poisson Transmissions and Bernoulli Losses 

In the case of Bernoulli losses and Poisson transmissions with parameters {pi]\<i<L and {Aj}i<i<i, the points 
in time at which the arrivals/departures occur over the i th link follow a Poisson process with parameter XiPi. Thus 
the number of packets pertaining to a given chunk (note that a dense code is a CC with only one chunk), in each 
partition pertaining to the i th link, has a Poisson distribution with the expected value XiPiNx/wq. Since the result of 
Chernoff bound also holds for Poisson random variables (see fT2| Theorem A. 1.15]), the main results in Section [XTTI 
apply to this case by replacing p with mini<i<L A;Pj. 

V. Discussion 

A. Dense Codes 

The upper bounds on the coding delay and the average coding delay, derived in this paper, are valid for any 
arbitrary choice of e. However, in the following, to compare our results with those of |7| and [8|, we focus on the 
case where e goes to polynomially fast, as k goes to infinity (i.e., e = l/k c , for some constant c > 0). For such a 
choice of e, the upper bounds on the coding delay and the average coding delay hold w.p. 1, as k goes to infinity. 

In (7j, the average coding delay of dense codes over the networks of length 2 with deterministic regular 
transmissions and Bernoulli losses with equal parameters (p) is shown to be upper bounded by ^(k + 0(^/k logfc)). 
The result of Theorem [2] indicates that the average coding delay of dense codes over the networks of length L 
with similar traffics as above (i.e., the special case of identical links with equal parameters is upper bounded 
by + (1 + o(l))(y/kL\og(kL))). This is consistent with the result of [7], although the bound presented here 
provides more details on the smaller terms in the 0(.) term. 

16 One should note that Theorems [l] and [2] are not restricted to the special case of identical links, and hold true for any arbitrary sequence of 
parameters. 



July 26, 2012 



DRAFT 



25 



The result of Theorem [T] suggests that the coding delay of dense codes over network scenarios as above is upper 
bounded by ^(k + (1 + o(l))(k 2 L\og(kL))i). One should note that there has been no result on the coding delay 
of dense codes over identical links in the existing literature. In fact, this was posed as an open problem in [8]. It is 
also noteworthy that unlike the analysis of fSJ, our analysis does not rely on the existence of a single worst link, 
and hence is applicable to the special case of identical links. 

In (8), the average coding delay of dense codes over the networks of length L with deterministic regular 
transmissions and Bernoulli losses with parameters {pi} was upper bounded by | where p = minjpj 

is the unique minimum and v = argmiriipj. This result was derived under the (impractical) assumption that the 
size of the finite field over which the coding scheme operates is infinitely large. 

Related to this result, Theorem [2] or Theorem [4] indicate that the average coding delay of dense codes over 
line networks with traffics as above, but with arbitrary or unequal parameters^ is upper bounded by ^{k + (1 + 
o(l))(-\/fcL log(fcL))), or (1 + o(l))(Lf(k) log(/cL))), respectively, where f(k) goes to infinity sufficiently 

slow (see Theorem |4j, as k goes to infinity. It is important to note that both Theorems [2] and [4] do not have the 
limiting assumption of the result of J8j regarding the size of the finite field. The bounds of Theorems [2] and [4] are 
larger than that of [8], which is expected, since the former, unlike the latter, are derived based on the practical 
assumption of operating over a finite field of size as small as two. 

The results of Theorems [T] and [3] indicate that for both traffics with arbitrary or unequal parameters, the coding 
delay is upper bounded by ^(k + (1 + o(l))(k 2 L\og(kL))i). This is while, in |8j, the coding delay is upper 
bounded by ^(k + O(ki)). This bound is looser than the bound in Theorem jlj or the one in Theorem^ although 
it is derived under the same limiting assumption as the one used in [8] for the average coding delay (i.e., the size 
of the finite field being infinitely large). Such an assumption makes the bound appear smaller than what it would 
be at the absence of the assumption. This demonstrates the strength of the bounding technique used in this work. 

By combining Theorems [T] and [2] or Theorems [3] and [4j it can be seen that the coding delay might be much 
larger than the average coding delay. This highlights the fact that the analysis of the average coding delay does not 
provide a complete picture of the speed of convergence of dense codes to the capacity of line networks. 

B. Chunked Codes 

Table [i] shows the upper bounds^] (w.p. of failure b.a.b. e) on the overhead and the average overhead (i.e., the 
difference between the coding delay or the average coding delay and the capacity) of CC over various traffics for 



different ranges of the size of the chunks based on the results in Section III and those in [6]rnThe traffics under 



"The special case of traffic parameters with a unique minimum can fall into each category of arbitrary or unequal traffic parameters. For 
example, aside from the uniqueness of the parameter with the minimum value, some other parameters might be equal, and hence such a case 
does not belong to the category of unequal parameters but it does belong to the category of arbitrary parameters. 

18 With a slight abuse of language, we refer to the "upper bound" on the overhead or the average overhead as the "overhead" or the "average 
overhead." 

"The results of Section 



III-B1 



and those of Section 



III-B2 



were stated in terms of q and a, respectively. In this section, for the ease of 



comparison, the former results are also restated in terms of a by replacing q with k/a. 
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TABLE I 

Comparison of Chunked Codes over Line Networks with Various Traffics 



Traffic 


Success 
Parameters 


Overhead (rj) 
and 

Average Overhead (ff) 


Size of Chunks 
(a) 


to 


Comments 


Arbitrary 
Deterministic 




v = fj = O (kL 




W ( 


L3 log M 


) 




m = ^ log ( 

a \ at / 

<5 = min | ^ , l| 

lei = \pi ~Pi-l\ 

7 e = min Kl < L 7 e; 
p = minimi pi 


Deterministic 

Regular 
Transmissions 
and 
Bernoulli 
Losses 


Arbitrary 


1 = J ((1 + o(l)) (~f +ki m h + m )) 












1 

3 


*?=K (1 + ° (1)) (^ +m )) 






1 

2 


Unequal 


7?=±((1+0(1)) 














1 

3 






u (/(fc) ■ f log 


1 / a \ 

7W) l^iogif ; 



consideration are: arbitrary deterministic traffics, and traffics with deterministic regular transmissions and Bernoulli 
losses. We refer to the latter traffics as the probabilistic traffics for simplification]^] The probabilistic traffics are 
categorized into two sub-categories: traffics with arbitrary parameters and traffics with unequal parameters. We say 
that a code is "capacity-achieving" (c.-a.) if the ratio of the overhead to the capacity goes to 0, as k goes to infinity. 
Similarly, a code is "capacity-achieving on average" (c.-a. a.) if the ratio of the average overhead to the capacity 
goes to 0, as k goes to infinity. In Table [I] the upper (or the lower) row in front of each case of traffic parameters 
corresponds to a c.-a. (or a c.-a.a.) scenario. 

In the table, one can see that, for each category of traffics, the size of the chunks (a) has to be sufficiently large 
so that CC are c.-a. or c.-a.a.. For arbitrary deterministic traffics, the lower bound on a is super-logarithmic in k, 
i.e., w(logfc), and super-log-cubic in L, i.e., lj(L 3 log L). For the probabilistic traffics with arbitrary or unequal 
parameters, the lower bound on a has a similar growth rate with fc, but a smaller (super-log-linear) growth rate with 
L, i.e., uj(L\og L). The coding cost of CC (i.e., the number of the coding (packet) operations per message packet), 
is, on the other hand, linear in a. Thus, CC can perform as fast over both the arbitrary deterministic traffics and 
the probabilistic traffics, but with a lower coding cost (smaller chunks) in the latter case compared to the former. 

Moreover, as it can be seen in Table [I] for both arbitrary deterministic and probabilistic traffics (in each case of 
arbitrary or unequal traffic parameters), the overhead grows sub-log-linearly with fc, i.e., 0(fclog 3 fc), and decays 
sub-linearly with a, i.e., (9(l/as). However, for arbitrary deterministic traffic, the overhead grows with 0(L log 3 L), 
and for the probabilistic traffics, it only grows with 0(L^ log 3 L). This implies a faster speed of convergence to 
the capacity in the latter case compared to the former. Similar comparison results can also be observed in terms of 
the average overhead, except that in the case of unequal traffic parameters, the average overhead decays linearly 
with a, i.e., 0(1 /a), but grows poly-log-linearly with fc, i.e., 0(fclog 2 fc), for the choice of /(fc) = O(logfc), and 

20 In the case of arbitrary deterministic traffics, the capacity is equal to k, and in the case of probabilistic traffics with parameters {pi}i<i<L. 
the capacity is equal to k/p, where p = mini<i<^pi. 
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TABLE II 

Comparison of Chunked Codes with Precoding over Line Networks with Various Traffics 



Traffic 


Success 
Parameters 


Overhead (rj) 
Average Overhead (ff) 


Size of Chunks 
(a) 


Comments 


Arbitrary 
Deterministic 




rj = fj = ~f a k 


Q 


(f lo s 


L 

767c , 


) 




< 7a, 7b: 7c < 1 
{7a, 7b, 7c} =0(1) 
7o =7c + (l + 7c)7o 
7i = (l + 7a)7i, + 0( 7b 2 ) 

7e, = \Pi ~Pi-l\ 

7 e = min Kl < L 7 ei 
p = mini <i<£Pi 


Deterministic 

Regular 
Transmissions 
and 
Bernoulli 
Losses 


Arbitrary 


^ = 7o| 




767c 




( £4fc «i)}) 
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log-linearly with L, i.e., O(ZlogL). 

Table [TT] shows the results for CC with precoding (CCP) in the scenarios similar to those considered in Table [I] 
where the precode is a capacity-achieving erasure code of dimension k and rate 1 — (l+7 )7&- in particular, one can 
see that CCP are "capacity-approaching" or "capacity-approaching on average" with an arbitrary small "non-zero 
constant" gap j Q (i.e., the ratio of the overhead or the average overhead to the capacity goes to j Q , as k goes to 
infinity) if a is sufficiently large. For simplifying the terminology, we drop the term "with a non-zero constant gap." 
The upper (or the lower) row in front of each case of traffic parameters corresponds to a capacity-approaching (or 
a capacity-approaching on average) scenario. For arbitrary deterministic traffics, the lower bound on a is constant 
in k, and log-cubic in L, i.e., (9(L 3 log L). For the probabilistic traffics with arbitrary or unequal parameters, the 
lower bound on a is also constant in k, but has a smaller (log-linear) growth rate with L, i.e., O(LlogL). Thus, in 
the case of CCP, one can make a conclusion similar to the one made in the case of stand-alone CC, with respect 
to the arbitrary deterministic and the probabilistic traffics. 
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